Picture for Yuki Mitsufuji

Yuki Mitsufuji

GUDA: Counterfactual Group-wise Training Data Attribution for Diffusion Models via Unlearning

Add code
Jan 30, 2026
Viaarxiv icon

Summary of The Inaugural Music Source Restoration Challenge

Add code
Jan 07, 2026
Viaarxiv icon

Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment

Add code
Jan 03, 2026
Viaarxiv icon

Do Foundational Audio Encoders Understand Music Structure?

Add code
Dec 19, 2025
Viaarxiv icon

AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path

Add code
Dec 15, 2025
Viaarxiv icon

Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal

Add code
Dec 14, 2025
Viaarxiv icon

PAVAS: Physics-Aware Video-to-Audio Synthesis

Add code
Dec 09, 2025
Figure 1 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Figure 2 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Figure 3 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Figure 4 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Viaarxiv icon

Coherent Audio-Visual Editing via Conditional Audio Generation Following Video Edits

Add code
Dec 08, 2025
Viaarxiv icon

MeanFlow Transformers with Representation Autoencoders

Add code
Nov 17, 2025
Viaarxiv icon

FoleyBench: A Benchmark For Video-to-Audio Models

Add code
Nov 17, 2025
Figure 1 for FoleyBench: A Benchmark For Video-to-Audio Models
Figure 2 for FoleyBench: A Benchmark For Video-to-Audio Models
Figure 3 for FoleyBench: A Benchmark For Video-to-Audio Models
Figure 4 for FoleyBench: A Benchmark For Video-to-Audio Models
Viaarxiv icon